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With a statistical view towards information and noise, information theory derives ulti- 
mate limitations on information processing tasks. These limits are generally expressed in 
terms of entropic measures of information and correlations. Here we answer the quantum 
information-theoretic question: "How correlated are two quantum systems from the per- 
spective of a third?" by solving the following 'quantum state redistribution' problem. Given 
an arbitrary quantum state of three systems, where Alice holds two and Bob holds one, what 
is the cost, in terms of quantum communication and entanglement, for Alice to give one of 
her parts to Bob? The communication cost gives the first operational interpretation to quan- 
tum conditional mutual information. The optimal procedure is self-dual under time reversal 
and is perfectly composable. This generalizes known protocols such as the state merging and 
fully quantum Slepian-Wolf protocols, from which almost every known protocol in quantum 
Shannon theory can be derived. 

Information might be regarded as an answer 
to a question. To know if it is raining, one need 
only look outside. However, a person living in a 
desert would expect a different answer than one 
living in a climate where on average, it rains ev- 
ery other day. After looking outside, who gains 
"more" information? The answer to this ques- 
tion has nothing to do with weather - statisti- 
cally, the desert dweller learns less, owing to the 
general predictability of desert rain patterns. 

The statistical approach to quantifying infor- 
mation was taken by Claude Shannon l , who 
found that entropy plays a central role. By mod- 
eling the weather as a random variable X which 
is equal to "rain" or "shine" with probabilities 
p(x), the information gained by looking out the 
window (or rather, the uncertainty one has be- 



fore looking) is the Shannon entropy 
H{X) = - ^2p(x) log 2 p(x) 

X 

of X. Suppose that the weather on each day is 
independent of that on the previous day, and that 
the overal climate is the same each day. Ac- 
cording to Shannon's theory, the weather for n 
days can be described by the answers to roughly 
nH(X) yes-no questions, or nH(X) bits, so 
that H(X) is the average number of bits needed 
to describe the weather on any given day. The 
weather problem is an example of data compres- 
sion. Shannon showed that, provided the aver- 
age number of bits used to describe a source X 
exceeds H(X) by any amount, no matter how 
small, it is possible to reconstruct the source 
from that description with vanishing error. Oth- 
erwise, errors will be unavoidable. The only 
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caveat is that to decrease the error, it is necessary 
to compress a larger set of data in a single block. 
Data compression gives Shannon entropy an op- 
erational meaning, or interpretation, as the min- 
imum average number of bits required to faith- 
fully represent X. Shannon further defined the 
conditional entropy as 

H(X\Y) = H(XY) - H(Y), 

which is also equal to the average entropy of 
X given Y. Conditional entropy measures the 
information someone knowing only Y would 
have to learn in order to know X as well. Its 
operational relevance was shown 2 by Slepian 
and Wolf to be the minimum number of bits 
needed to represent X for someone who knows 
Y. Shannon also introduced mutual information 

I(X;Y) = H(Y) - H(Y\X) 

and conditional mutual information 

I(X; Y\Z) = H(Y\Z) - H(Y\XZ), 

each of which is interpreted as the information 
shared by X and Y, only the latter is measured 
from the perspective of someone knowing Z. 
Mutual information plays a fundamental role in 
characterizing the capacity for a noisy channel 
to transmit information 1 . Its conditional coun- 
terpart arises in the answers to many problems, 
such as in rate distortion with side information 
at the decoder 3 and communication with side 
information at the encoder 4 . It also appears in 
the analysis of degraded broadcast channels 5 . 
All four of these quantities can easily be shown 
to be nonnegative. 

In recent years, a quantum mechanical gen- 
eralization 6 of Shannon's theory has been un- 
der development. Here, a random variable is 
replaced with a quantum system C with den- 
sity matrix p c . The quantum analog of Shan- 
non entropy is von Neumann entropy H(C) P = 



— Tr p c log 2 p , which is the Shannon entropy 
of the eigenvalues of p° . While von Neumann's 
entropy preceded Shannon's by almost twenty 
years, its operational interpretation was only 
found relatively recently by Schumacher 7 , who 
showed that a large number n of quantum sys- 
tems, identically prepared in the state p c , could 
be compressed into a space of roughly nH{C) 
qubits, or two-level quantum systems. Here, 
a successful compression scheme is one which 
preserves the correlations C shares with the rest 
of the world, modeled by a reference system R. 
The combined system is considered to be in any 
pure state \ip) CR satisfying p c = Tr R \ip)(ip\ CR . 
We then say that Alice holds a purification of 
the reference R. Just as Shannon designated the 
abstract "bit of information", Schumacher pro- 
vided the "qubit of quantum information" as a 
fundamental unit quantum information. Quan- 
tum information cannot be known per se, al- 
though classical information about the identity 
of the quantum state can be inferred by making 
measurements. Rather, quantum information is 
something one can possess by having control 
over a quantum mechanical system which em- 
bodies it. It has been known for some time that 
quantum information cannot be copied 8 , so in 
contrast to the classical case, if Alice "knows" C 
and would like to "tell" it to Bob, not only must 
she send him at least H(C) qubits, she will lose 
possession of C in the process. 

The analogy can be continued, defining a 
quantum counterpart for each of Shannon's 
quantities by replacing Shannon with von Neu- 
mann entropies. Quantum mutual information 
9 I (A; B) can be considered as a measure of 
correlations between A and B. It plays a re- 
markably similar role as its classical counter- 
part, describing the classical capacity of a noisy 
quantum channel in the presence of free entan- 
glement 910 . It also has a direct operational 
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interpretation 11 as the smallest rate of classi- 
cal randomness which erases all correlations be- 
tween A and B. On the other hand, quantum 
conditional entropy H(A\B) and quantum con- 
ditional mutual information (QCMI) I (A; B\C) 
are less like their classical counterparts, as they 
cannot generally be viewed as averages. Fur- 
thermore, H(A\B) can be negative; —H(A\B) 
is often referred to as the coherent informa- 
tion 12 , which plays a role in characterizing 
the capacity of a quantum channel for trans- 
mitting quantum information 13 15 . The opera- 
tional task of state merging 16 gives meaning to 
H(A\B) where, depending on its sign, it corre- 
sponds to the rate at which entanglement is ei- 
ther consumed or generated while transferring 
A to someone already holding B. On the other 
hand, QCMI can be shown to be nonnegative. 
Unlike the classical case, this amounts to a the- 
orem, known as strong subadditivity of quan- 
tum entropy, whose original proof 17 relies on 
nontrivial tools from matrix analysis. More re- 
cently, operational proofs have been found n ' 16 , 
and we will see that our protocol leads to yet 
another such proof. Strong subadditivity is cor- 
respondingly powerful; it underlies virtually ev- 
ery known bound in quantum information the- 
ory. An early consequence of this result was a 
proof 18 of the existence of the entropy density 
for translationally-invariant quantum statistical 
models. More recently, strong subadditivity has 
found applications to geometric entropy 19 and 
conformal field theory 20 . Despite its central 
role, a direct operational interpretation of QCMI 
on an arbitrary state has been conspicuously ab- 
sent, although it has arisen in operational inter- 
pretations for certain restricted classes of under- 
lying states 21 . It is our primary focus to give 
such a general interpretation. 

The ability to send qubits from Alice to Bob 
is a resource, and Schumacher's theorem tells 



Ref 

® 


Alice 

•X* 


Bob 
(E) 


® 


(a) 





Figure 1 : Quantum state redistribution. 



how much of it is needed to transfer C. A 
weaker resource is entanglement, because it can 
be established by by sending qubits. A "stan- 
dard unit" of entanglement is called an ebit and 
consists of a single EPR pair = 4g (|00) + 

1 11)) shared between Alice and Bob. By quan- 
tum teleportation 22 , an ebit can be used to send a 
qubit, provided that two classical bits are sent as 
well. In the absence of classical communication, 
ebits are not helpful for moving C from Alice to 
Bob. They are helpful, however, in a variant of 
Schumacher's scenario in which Alice and Bob 
have some side information. We model this with 
four systems in the state \ip) ABCR , where Al- 
ice holds AC and Bob has B, and ask Alice to 
give C to Bob, as in Figure [H by sending qubits 
and using entanglement. In particular, no classi- 
cal communication is allowed, beyond what can 
be encoded in qubits. We allow the entangle- 
ment cost to be any real number, interpreting 
positive and negative values as in state merging. 
Our main result is that there exists a protocol - 
quantum state redistribution - allowing Alice to 
transfer C to Bob at a cost of Q qubits and E 
ebits if and only if 

Q > \I{C;R\B) 
Q + E > H(C\B). 

This gives the first direct operational interpre- 
tation of QCMI on an arbitrary state. Simulta- 
neously minimizing Q and Q + E leads to the 
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Figure 2: Region of achievable cost pairs, to- 
gether with the time-reversed optimal cost pair, 
assuming I(C;A) > I(C;B). 





Figure 3: Left: Unitary quantum state redistri- 
bution protocol in which Alice redistributes C 
to Bob while consuming entanglement (assum- 
ing that I(C;A) > I(C;B)). Right: Corre- 
sponding time-reversed process where Bob re- 
distributes C to Alice, this time generating the 
same amount of entanglement. 



optimal cost pair 

Q = \I(C-R\B) (1) 
E = \I{C-A)-\I(C-B). (2) 

This pair corresponds to the corner point of the 
region in Figure [2] As with Schumacher com- 
pression, this result is to be understood in the 
limit of many identical copies. Let us now point 
out some remarkable features of our result. 



Self-duality under time reversal: As illus- 
trated in Figure [H our protocol can be imple- 
mented unitarily - if entanglement is consumed 
by the protocol, reversing those unitaries leads 
to a protocol which instead sends C from Bob 
to Alice, while generating the same amount of 
entanglement. Perhaps surprisingly, this sym- 
metry is also evident in the optimal cost pairs: 
switching A and B reflects the optimal cost pair 
about the Q-axis (see Figure [2]). Thus, switch- 
ing A and B changes the sign of E in (O, but has 
no effect on the expression (OQ) for Q because the 
identity I(C; R\B) = I{C; R\A) holds on every 
pure state \tp) ABCR . In fact, our protocol can be 



considered as providing an explaination for why 
this identity should be true. 



Perfect composability: Suppose that Alice 
wants to transfer a composite system CD to 
Bob. An optimal strategy is for Alice to treat 
CD as a single system, sending them both si- 
multaneously using our protocol. The optimal 
cost pair for this is 

Q = \I{CD-R\B) 

E = \I{CD-A)-\I{CD-B). 

What if she sends the systems successively, as 
depicted in Figure HI? The optimal cost for first 
transferring D is 

Qd = \I{D;R\B) 

E D = \I{D-AC)-\I{D-B). 

Since Bob now has D, the remaining cost for 
sending C is 

Q c = \I{C-R\DB) 

E c = \I{C;A)-\I{C;DB). 
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Figure 4: Successive state redistribution. 

Simple algebraic manipulations then show that 
Q = Qc + Qd and E = E c + E D \ This fea- 
ture parallels successive refinement in classical 
rate-distortion theory 23 , only here the Markov 
condition is absent. 



Applications: Consider the following illustra- 
tive examples and applications of state redistri- 
bution: 

(1) Four-party cat state: The optimal cost 
pair for the state ^(|0000) + | 111 1>) we find 
that the optimal cost pair is Q = E = 0. To 
redistribute C from Alice to Bob, Alice applies 
the local isometry|0) (00| + |1><11|, while Bob 
applies its inverse. 

(2) Four-party W state: If the global state 
is i (| 1000) + |0100) + |0010) + |0001)), we 
obtain Q rs .38 and E = for the optimal 
cost pair. For comparison, a compress-and-send 
strategy which ignores the side information re- 
quires roughly .81 qubits. 

(3) States saturating strong subadditiv- 
ity: The states which require a zero rate 
of communication to redistribute C are pre- 
cisely those which saturate strong subadditivity 



(I(C;R\B) = 0), and are thus locally equiva- 
lent to a state of the form 24 

Y,^) a ^*) b ^^) AcBcC w*) ArBrR - 

X 

The entanglement cost for such states is 
ExPx(H(B c )^ x - H{Ac)fa). Another opti- 
mal strategy is thus to coherently concentrate 25 
the AcC\Bc entanglement while diluting 26 the 
A C \CB C entanglement in the individual states 

(4) State merging: Our state redistribution 
protocol allows for a deeper understanding of 
state merging 16 27 . By adding the additional re- 
source of free classical communication, we re- 
cover their result that the cost, in ebits, for merg- 
ing C to B is equal to H(C\B). Accounting for 
transmitted bits as well, state merging consid- 
ers A to be part of the reference, requiring that 
I(RA; C) bits be sent per copy of C merged. By 
our result, the classical communication cost is 
reduced to I(R; C\A) < I(RA; C), which can 
be shown to be optimal by an argument similar 
to the one we give for our protocol. Thus, QCMI 
can also be regarded as the classical communi- 
cation cost for state transfer in the presence of 
unlimited entanglement. 

(5) Fully quantum Slepian-Wolf (FQSW): A 
special case of our result is when Alice has no 
side information. An optimal strategy for this 
scenario has been found previously and called 
the fully quantum Slepian-Wolf protocol 28 ' 29 , 
which can transfer C from Alice to Bob using 
Q qubits and E ebits if and only if 

Q > \HC;R) 
Q + E > H(C\B). 

The optimal cost pair is 

Q = ±I{C;R), E = -\I{C-B). 
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(6) Fully quantum reverse Shannon (FQRS): 
If it is instead Bob who lacks side information, 
we obtain the previously studied 28 29 fully quan- 
tum reverse Shannon protocol as a special case 
of our result. Here, the required costs for trans- 
ferring C to Bob are given by 

Q > \I{C;R) 
Q + E > H(C). 

The optimal cost pair is dual to that of FQSW 
under time reversal 28 : 

Q = \I(C;R), E=\I(C;A). 



The protocol: Here we describe the proof that 
our protocol exists; a more detailed treatment 
(I.D. J.Y.) is forthcoming. First note that by 
the FQRS protocol, Alice can use \l(C; RB) 
qubits and \l(C; A) ebits to simulate the isom- 
etry which moves C to Bob, while keeping A to 
herself. In order to take advantage of Bob's side 
information, Alice can use a modification of that 
protocol which also transmits I(C; B) bits per 
copy of C which is moved 10 ' 30 . It is furthermore 
possible to make the classical communication 
coherent 3132 in the following sense. We say that 
Alice sends a coherent bit 31 to Bob if she ap- 
plies an isometry \x) A i— > \x) A \x) B to a qubit in 
her possession, where Bob has B and x = 0,1. 
Asymptotically, two coherent bits can be used to 
send a qubit and to generate an ebit 31 , so Alice 
can send an additional \l{C\ B) qubits, while 
generating the same number of ebits with Bob. 
This leads to a catalytic scenario, where extra 
ebits and qubits are needed to start the protocol, 
but are returned after completion. The depen- 
dence on the catalysts can be eliminated using 
methods in 33 . Subtracting the resources gener- 
ated by the protocol from those which were in- 
vested yields the optimal cost pair. On the other 
hand, the optimality of our protocol is shown in 



34 to follow from that of FQSW by subtracting 
the optimal costs for Alice to send only A from 
the costs for Alice to send AC. 



Discussion: For an arbitrary pure state 
\ip) ABCR , we have determined the communi- 
cation and entanglement resources which are 
necessary and sufficient for Alice and Bob, 
who respectively hold A and B, to transfer C 
between themselves while retaining the purity 
of the global state. The optimal communication 
cost gives the first operational interpretation of 
QCMI on an arbitrary state and also gives a 
natural interpretation to the pure state identity 
I(C;R\A) = I(C;R\B): the correlations 
between C and R look the same from each 
of Alice's and Bob's perspectives. Because 
of this, the communication cost is symmetric 
under time-reversal. On the other hand, the 
optimal entanglement cost was shown to be 
antisymmetric under time-reversal, so that if 
ebits must be consumed to move C one way, the 
same number can be generated while moving it 
back. 

There is a formal time-reversal duality be- 
tween FQSW and FQRS 28 . We showed that our 
protocol is self-dual in the same sense, while in- 
corporating both results as special cases. Inter- 
estingly, our coding theorem is based on a gen- 
eralization of that from FQRS, while both the 
coding theorem and the converse from FQSW 
are used for our converse. 

A corollary of our main result is a direct op- 
erational proof of strong subadditivity. Ours dif- 
fers from other such operational proofs 1116 be- 
cause it does not even rely on the subadditiv- 
ity of entropy, i.e. that H(A) > H(A\B). In- 
deed, before removing the dependence on cata- 
lyst channels, our protocol cannot simulate more 
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qubit identity channels than were initially pro- 
vided so that positivity of the overall qubit cost 
(QCMI) is evident. 

Our protocol is the first which involves an ar- 
bitrary four-partite pure state. As such, it can 
be applied as a fundamental primitive for all 
multi-party state redistribution problems. In- 
deed, whenever there is a sender (Alice) and 
a receiver (Bob), there are four natural subsys- 
tems: the system A which stays with Alice, the 
system B which Bob already has, the system 
C which is being communicated, and the rest 
of the world R. Even if there are many more 
parties, each particular round of communication 
fits into our setting. For instance, suppose Al- 
ice holds AC a, Bob has BCb and Charlie holds 
C, while all systems are purified into a refer- 
ence system R. If the goal is to transfer Ca and 
C B to Charlie, direct application of our result 
gives a four-dimensional region of achievable 
costs (Q A ^ C , Q B ^ C , E AC , E BC ), generated by 
two corner points, each corresponding to a dif- 
ferent order in which Charlie receives the sys- 
tems Ca and Cb- It is likely that other strategies, 
such as where Ca and C B are split into multiple 
subsystems and are sent to Charlie in various or- 
ders, would lead to even larger achievable re- 
gions. Furthermore, it is known 29 that FQSW, 
when combined with teleportation 22 and super- 
dense coding 35 , recovers virtually every known 
quantum Shannon-theoretic protocol. We ex- 
pect even more from state redistribution and are 
currently investigating its further implications 
for constructing more complex protocols and for 
understanding the structure of multipartite quan- 
tum states. 
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